A Parallel Document Retrieval Server For The World Wide Web
نویسندگان
چکیده
An architecture is proposed which enables the Parallel Document Retrieval Engine (PADRE), running on a single-user Fujitsu AP1000 multicomputer, to operate as an information server on the World Wide Web. The advantages and disadvantages of a distributed memory parallel machine for this purpose are discussed and the likely applicability to di erent types of parallel machine is considered. Ideas for a range of types of remote querygeneration client are outlined and measurements of query processing speed are reported, shedding some light on potential load handling capacity of this parallel server.
منابع مشابه
A Parallel Document Retrieval Server For The World
An architecture is proposed which enables the Parallel Document Retrieval Engine (PADRE), running on a single-user Fujitsu AP1000 multicom-puter, to operate as an information server on the World Wide Web. The advantages and disadvantages of a distributed memory parallel machine for this purpose are discussed and the likely applicability to diierent types of parallel machine is considered. Ideas...
متن کاملWWW Search Systems Using SQL*TextRetrieval and Parallel Server for Structured and Unstructured Data
We describe our experience in developing Web Search Systems using Oracle’s SQL*TextRetrieval. In the prototype system we store on-line books in the HTML and the HTML documents of a web site, SQL*TextRetrieval is used to index full text and other structured data in the ’web space’ and to provide an efficient search engine for free-text search. The Web enables global access to and maximum informa...
متن کاملDiscovering Parallel Text from the World Wide Web
Parallel corpus is a rich linguistic resource for various multilingual text management tasks, including crosslingual text retrieval, multilingual computational linguistics and multilingual text mining. Constructing a parallel corpus requires effective alignment of parallel documents. In this paper, we develop a parallel page identification system for identifying and aligning parallel documents ...
متن کاملA Toolkit for Analyzing Client Access Patterns in the World-Wide Web
The explosive growth of the World-Wide Web increases demand for improved facilities for caching and replication. Better understanding of user access patterns is a prerequisite for improvement. Most wide-area information servers are instrumented to collect statistics about document retrieval. However, server-side tracing offers a limited picture of client access patterns and navigation strategie...
متن کاملThe Effectiveness of Cache Coherence Implemented on the Web
The popularity of the World Wide Web (Web) has generated so much network traffic that it has increased concerns as to how the Internet will scale to meet future demand. The increased population of users and the large size of files being transmitted have resulted in concerns for different types of Internet users. Server administrators want a manageable load on their servers. Network administrato...
متن کامل